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-Abstract- 

In the dynamic tree problem the goal is the maintenance of an arbitrary n-vertex forest, where the 
trees are snbject to joining and splitting by, respectively, adding and removing edges. Depending 
on the application, information can be associated to nodes or edges (or both), and queries might 
require to combine values in path or (sub)trees. 

In this paper we present a novel data structure, called the Depth First Tour Tree, based on a 
linearization of a DFS visit of the tree. Despite the simplicity of the approach, similar to the ET- 
Trees (based on a Euler Tour), our data structure is able to answer queries related to both paths 
and (sub)trees. In particular, focusing on subtree computations, we show how to customize the 
data structure in order to answer queries for three distinct applications: impact of the removal of 
an articulation point from a graph, betweenness centrality and closeness centrality of a dynamic 
tree. 
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[T] Introduction 

In the dynamic tree problem the goal is the maintenance of an arbitrary n-vertex forest, 
where the trees are subject to joining and splitting by, respectively, adding and removing 
edges. Depending on the application, information can be associated to nodes or edges (or 
both), and queries might require to combine values in path or (sub)trees. 

The dynamic tree problem has several applications, ranging from network flows ismansi 
I26j . one of the original motivations, to other graph algorithms including connectivity m, 
biconnectivity m, and minimum spanning trees mm, and other combinatorial prob¬ 
lems nano]. With such a wealth of applications, it is not surprising the fact that there 
are several approaches to solve (at least partially) the dynamic tree problem using 0{logn) 
time per operation: ST-trees [5HI2S]) ET-trees [T71|5S], topology trees [TTJIH1II3]) top 
trees 013117], RC-trees 00, and Mergeable Trees [TS] that build up on the ST-tree and, 
as the name suggests, support also the merge operation. All these approaches map a generic 
tree into a balanced one, and can be divided into three main categories: path decomposition 
(ST-trees, Mergeable Trees), tree contraction (topology trees, top trees, RC-trees), and 
linearization (ET-trees); refer to the dissertation of Werneck pS] and the experimental 
comparison of Tarjan and Werneck m for a more complete picture about techniques and 
applications. 
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Dynamic subtree queries revisited: the Depth First Tour Tree 


Approach. In this paper we present a novel data structure, called the Depth First Tour 
Tree (DFT-Tree), to solve the dynamic tree problem; the DFT-Tree, as the ET-Tree, is 
based on a linearization: as the name suggests, we linearize the tree following a DFS visit of 
it (see Figure]^ where is shown for comparison also the Euler Tour). The main consequence 
of this approach is that the whole subtree of a node is stored contiguously, thus allowing us 
fast operations on the subtree, as we will detail in the rest of the paper. As we can see from 
Figure for example, the subtree of node 4 is contiguous in the DFT-Tree, whilst node 
4 itself appears twice in its own subtree in the corresponding ET-Tree. DFT-Tree data 
structure can be easily implemented on top of any Balanced Binary Search Tree (BBST), 
such as Splay Trees [25] and Red-Black Trees [Sj. 

The idea of linearizing the tree according to its DFS visit and maintaining the linearization 
in an efficient data structure is not new in the literature. Indeed, the very idea was exploited 
in other works, most notably IIH1IIIII22], in the context of succinct trees. However, given 
the additional constraint of succinctness, the focus of these works is inherently different, and 
the set of supported queries is weaker and less oriented to data-processing operations. 

The DFT-Tree supports all the operations shown in Table that are divided in three 
groups: i) structural operations, i.e. the ones that alter the structure of the tree, ii) structural 
queries, and iii) operations related to the values stored in the vertices; as we can see, it 
supports all the traditional dynamic tree operations together with others, such as LCA and 
CONDENSE, that are not completely standard and, thus, not supported by all the data 
structures; CONDENSE, in particular, allows to use the DFT-Tree to implement the Block 
Forest structure, following the exact algorithm of Westbrook and Tarjan m- 

Furthermore, the DFT-Tree supports three non standard generic operations, to be 
customized depending on the applications, that are: 

H COMBINe(u), that aggregates values in the path between vertex v and the root of the tree; 

H REDUCE-CHiLDREN(ri), that aggregates values of the children of v\ 

H REDUCE-CHiLD-SUBTREES(u), that aggregates values in the subtrees rooted in the children 
of V. 

These generic functions are, probably, the most interesting aspect of DFT-Trees. 

Contribution. We propose a novel data structure, combining the simplicity of the Euler 
Tour trees with the expressiveness of the Depth First visit of a tree. We believe that the 
contribution of our approach is twofold: 

H the resulting data structure is simple, using only elementary concepts, and thus is easy 
to understand, analyze and implement; 

H we give a unified framework for treating a vast class of data aggregation tasks on subtrees. 
While our data structure is able to support basic operations on paths, it is primarily designed 
to aggregate data on subtrees, an operation which is usually non-trivial with other data 
structures. 

Unlike ST-trees, topology trees and RC-trees, DFT-Trees do not require the underlying 
forest to have vertices with bounded (constant) degree in order to efficiently cope with subtree 
queries. Degree restrictions can be avoided by ternarizing the input forest but, as observed 
in 120],“ this introduces a host of special cases” and complicates the data structure. In the 
special case of ST-trees, some work has been done m to support queries on subtrees for a 
restricted set of operations (for example, giving the minimum element of a given subtree) 
without the need for ternarization, but the resulting data structure is still very complicated, 
both to analyze and implement. The same task can be performed extremely easily with 
DFT-Trees. 

Furthermore, DFT-Trees can naturally aggregate on all the children subtrees of a node 
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Euler tour: 

4|io|4|2|5|2|i|3|g|3|7|3 


Depth first tour and parenthetical sequence: 


1 2 4 

( ( ( 


10 10 4 5 
( ) ) ( 


7 7 3 

( ) ) 


M Figure 1 An example of Euler Tour, Depth First Tour and parenthetical sequence of a tree 
(introduced in Section®. 


V in parallel without having to pay a cost proportional to the degree of v itself: for example, 
as we will see, given a node v it takes O(logn), independently from the degree of u, to answer 
the child of v whose subtree is the largest. This is an interesting feature that distinguishes 
our data structure, and can be useful for practical problems, as we will demonstrate in the 
final sections of this paper. 

The extreme flexibility of use of the structure comes at the cost of its structural rigidity. 
In particular, while all other structural operations require logarithmic time in the forest size, 
the EVERT operation requires a cost proportional to the depth of the node being everted. 
However, when either the number of eversions is small compared to the total number of 
queries performed, or the costs of the eversion is amortized, the cost of evert can be 
regarded as being O(logn) like all the other structural operations. This is the case in all the 
applications we present. 

Applications. In order to explain the versatility of the approach, we show how to customize 
the above functions for three distinct applications, based on subtree computations: 

H Given a streaming graph, for which we maintain all the biconnected properties using the 
mentioned approach of Westbrook and Tarjan, we can also compute the impact of an 
articulation point u, introduced in the context of the Autonomous Systems (AS) graph, 
as a measure of the resiliency of the network. The impact of u is defined as the number of 
vertices that gets disconnected from the main connected components after the removal of 
u. This application requires the determination of the subtree of a node having maximum 
size. 

H The betweenness centrality of a vertex r; in a tree. This requires to count the sum of the 
squares of the sizes inside subtrees. 

H The closeness centrality of a vertex n in a tree. This requires the sum of the distances to 
every node in the subtree and in the tree above v. 

In each of the above applications, the query on a vertex can be executed in time 0(logn) 
for an n-vertices dynamic forest. 

This paper is organized as follows: we conclude this section by recalling few preliminary 
notions. In Section we describe the main ideas of the DFT-Tree, detailing the operations 
related to subtrees and paths in Section In Section we show how to customize the 
generic operations of the DFT-Trees in order to support the applications listed above. 
Due to space constraints, we omit the proofs and low-level details such as the extensions 
of the operations to (edge-)weighted forests. More details about the implementation of the 
DFT-Trees operations can be found, together with the pseudocode, in the Appendix. 
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Operation 

Complexity 

Description 

link(u, v) 

C>(logn) 

Makes the root of the tree containing vertex v 

a child of vertex u. 

cut(u) 

C>(logn) 

Deletes the edge connecting v to its parent, 
splitting the tree. If v is the root of the tree, 
nothing happens. 

condense(u) 

G(logn) 

Deletes vertex v; its children become children 
of the parent of v. If vertex v is the root, the 
number of connected components of the forest 
increases by d — 1, with d being the degree of v. 

erase(u) 

C>(logn) 

Deletes vertex v and all its adjacent edges. 

EVERT(ti) 

0{d\og n)[^ 

Re-roots the tree containing vertex v at vertex v. 


root(u) 

C>(logn) 

SAMe-tree(u, v) 

C>(logn) 

is-descendant(u, v) 

C>(logn) 

PARENT (u) 

C>(logn) 

ancestor(u, k) 

C>(logn) 

lca(u, v) 

C>(logn) 

degree(w) 

C>(logn) 

LIST-CHILDREN (u) 

G(51ognf| 

CHANGE-VAl(-!;, x) 

C’(logn) 

REDUCE-CHILDREN 

^(log 

REDUCE-CHILD-SUBTREES 

©(log n)^ 

COMBINE 

©(log nf 


Returns the root of the tree containing node v. 
Tests if nodes u and v belong to the same tree. 
Answers whether node w is a descendant of v. 
Returns the parent of node v. 

Returns the ancestor of node v at depth dv — k, 
where represents the depth of u, if existent. 
Returns the lowest common ancestor of nodes u 
and V (if they belong to the same tree). 

Returns the degree of node v. 

Returns a list containing the children of vertex v. 


Assigns val(ii) = x. 

See description in the text, Section 
See description in the text, Section 
See description in the text. Section 
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M Table 1 DFT-Tree operations on an n vertex forest. The complexity values reported are 
amortized complexity if we implement the DFT-Tree with Splay Trees [25] and worst-case complexity 
if we use Red-Black Trees |H]. 


Preliminaries 

We assume the reader is familiar with basic concepts of graph theory (see, e.g., i)- We recall 
that, in an undirected graph G, a connected component is a maximal set of vertices V GV 
such that, given u,v £ V, there is at least one path between u and v in G; an articulation 
point is a vertex v € V such that its removal from the graph G increases the number of 
connected components of G; similarly a bridge is an edge e € E such that its removal from 
the graph G increases the number of connected components of G. A biconnected component 
is a maximal set of vertices V" C V such that after the removal of any v G V", the remaining 
graph V"/v is connected. Following |B], the impact of an articulation point is the number 


^ Where d is the depth of the node involved. We note that the evert operation is slow in the worst case, 
but it is possible to amortize it by always everting the smallest tree. 

^ Where 5 is the degree of the node passed as argument to DEGREE. 

^ Assuming that the operations (denoted with 0 and 0) in REDUCE-children, reduCE-CHILD-SUBTREES 
and COMBINE take constant time when called with two nodes. 
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M Figure 2 Effects of the link, cut and condense operations. 


of vertices that get disconnected from the largest connected component when v is removed 
from the graph. 

There are several measures of centrality of vertices in a network. In this work we refer to 
the betweenness centrality and closeness centrality. The betweenness centrality, originally 
defined in [Tl], is defined as follows: hc{u) = where crst{u) is the number of 

shortest paths between s and t that pass through u, and ast is the total number of shortest 
paths. The closeness centrality, proposed by Bavelas in 1950 |7j, is the reciprocal of the 
farness of a vertex, where the farness is the sum of all the distances to the other vertices in 
the graph. 

[~yi Depth First Tour Trees 

In this section we describe the main idea of the DFT-Trees, which builds up on the Depth 
First Visit of the tree and its linearization into an array; for the sake of the exposition we 
will populate this array with (opening and closing) parentheses that will be denoted as the 
parenthetical sequence of the tree. The other key ingredient of the DFT-Trees is a summary 
defined over the parenthetical sequence: in the underlying BEST the node corresponding to 
vertex v is augmented with both the information about v and the summary of its subtree (in 
the BEST). The depth first visit of a tree is constructed by recursively visiting nodes in a 
depth-first fashion. When a node is entered for the first time, it is appended to the back of 
depth first tour, along with a tag indicating it was a newly-opened node (called an open-node); 
when all its children have been visited, we push back the node again before returning the 
call, this time with tag indicating this is a fully explored node (called a close-node). Since 
every node is appended to the list exactly twice, the size of the depth first tour of a tree of 
size n is 2n. 

Figure shows the depth first tour of an example tree of size 10, together with its 
linearization: an array that contains its parenthetical sequence; the Euler Tour of the same 
tree is shown for comparison: note that in an Euler Tour a node can appear several time; 
the size of an Euler Tour is 1 -I- 2m = 2n — 1, since an Euler Tour begins with a node and 
then, for each edge of the tree, both its endpoints are added exactly once, when entering the 
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4 

3- 

Depth: ^ 

0 

-1 


Sequence: ( | ( | ( | ) | ( | ) | ( | ( | ) | ) | ) | ( | ) | ) | ) | ( | ( | ) | ( | ( 

M Figure 3 Depth of a sequence of parentheses. In this case, the summary of the sequence is the 
pair (—1,3). The summary of the first four parentheses is (0,2). 



node. In Figure we can see the effects of the link, CUT and CONDENSE operations on the 
tree and the corresponding parenthetical sequence. 

► Definition 1 (depth of a parenthesis). We define the depth of a parenthesis in a sequence 
of parentheses as the difference between the number of open parentheses and the number of 
closed parentheses in the prefix of the given sequence ending in that parenthesis. 

The sequence of the depths of the parentheses coincides with the prefix sums of the sequence 
obtained by replacing every open parenthesis with a 1 and every closed parenthesis with a 
- 1 . 

► Definition 2 (summary of a sequence of parentheses). We define the summary of a sequence 
of parentheses as the pair of integers {a,b), where a is the minimum between 0 and the 
minimum depth of the parentheses of the sequence, and b is equal to the difference between 
the depth of the last parenthesis and a. 

In the following, we refer to the first value of the summary as to the down-value, and to the 
second as to the np-value. Note that the down-value of a summary is always non-positive, 
while the up-value is always non-negative. In Figure]^ we show a graphical representation 
of the depth of the parentheses in the sequence: for example, the summary of the whole 
sequence is the pair (— 1 , 3 ), whilst the summary of the first four parentheses is ( 0 , 2 ). It 
should be clear that the summary of the sequence made of just one open parenthesis is (0,1), 
while the summary of the sequence made of just one closed parenthesis is (—1,0). 

The following lemmas hold for any sequence of parentheses: 

► Lemma 3. The down-values of the prefixes, taken in order, of any sequence of parentheses 
form a monotonically decreasing sequence of integers. 


► Lemma 4. A sequence of parentheses is balanced if, and only if, its summary is equal to 
( 0 , 0 ). Any prefix of a balanced parenthetical sequence has down-value 0 . 


► Lemma 5. Let 81,82 he two sequences of parenthesis having summary (ai,6i) and (02,^2) 
respectively. The summary of the sequence 81 -\- 82 obtained by concatenating 81 and 82 is 
the pair (ai,6i) ffl (02,^2), where the sum between summaries is defined as: 


(ai, 6 i) ffl (02,^2) 


(fll, 61 -f 02 -f &2) if bl -\-02 >0 
(ai -1-61-1-02, 62) otherwise. 


► Lemma 6. The sum of two summaries defined above is an associative operation. 

As a consequence of Lemma as we mentioned before, we can store in each vertex of 
the BEST the sum of the summaries of all the vertices in its subtree. We proceed with the 
following lemma: 
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M Figure 4 Characterization of the parent of node v, as stated in Lemma The values under the 
small dots represent the down-values of the prefixes. 


► Lemma 7. Let close-v be the close-node associated with the non-root node v. The close- 
node associated with the parent of v is the first (leftmost) node u after close-v reaching 
depth —1 relative to close-v. 

Lemma together with the associativity of ffl and the monotonicity of the down values 
of the prefixes of any (sub)sequence of parentheses (Lemma |^, gives us an efficient way to 
locate the parent of any non-root node: we simply binary search the smallest prefix having a 
negative down-value, inside the suffix of the parenthetical sequence starting after close-v. 
Refer to figure]^ for a visual insight. Similar properties hold for lea and ancestor: for example, 
for the A:-th ancestor we can (binary) search the first node reaching relative depth —k with 
respect to close-v, after close-v. 

[~^ Subtree (and path) operations 

In this section we detail the subtree and path operations. As we mentioned before, we assume 
that each node v has an associated value (note that values can be generic objects, not only 
numbers), denoted by val(u). We have the following three generic operations on a node that 
operate, respectively, on its children, on its subtree, and on the path from the node to the 
root: 

H reduce-children('(;, 0): Computes the value of 

val(ci) 0 • • • © val(cd), 

where Ci,...,Cd are the children of node v, and 0 is an associative operation (not 
necessarily invertible). 

H REDUCE-CHiLD-SUBTREES(u, 0,0): Computes the value of 

S(ci) 0 S(c 2 ) 0 S(c 3 ) 0 • • • 0 S(cd) 

where ci,..., Cd are the children of node v, 0 and 0 are associative operations (not 
necessarily invertible), and S(a::) = val(a::i) 0 • • • 0 val(a::m) is some information about the 
subtree rooted at x and containing nodes xi,, Xm- 
H COMBiNE(ri, ©): Computes the value of 


val(z;i) © • • • © val(tim), 

where v = vi,V 2 ,..., Vm are the nodes in the path from v to the root of the tree, and © 
is an associative and invertible operation. 
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M Figure 5 Visual insight for Lemma The numbers written in the nodes of the tree on the left 
represent the values assigned to the vertices. 


Differently from all other arguments, the operations denoted with 0 , 0 and © used in the 
three operations above have to be known in advance, so that the DFT-Tree knows what 
partial evaluations it should memoize in the nodes. 

Among the three operations, COMBINE is the most straightforward, implementation-wise. 
The idea is to assign a value to both the open-nodes and close-nodes of the DFT-Tree: we 
assign the value of the vertex val(n) to the open-node of v, and the opposite value — val(n), 
i.e. the inverse of val(u) with respect to operation ©, to the corresponding close-node. We 
can thus state the following lemma, depicted in Figure for the case © is the traditional 
sum operator 

► Lemma 8. Let open-v be the open node associated with the tree node v. The value of 
COMBiNE(n, ©) is equal to the Q-combination of the values of the nodes in the prefix of the 
DFT-TREEendOTg in open-v. 

In order to implement reduce-CHILDREN and reduce-CHILD-SUBTREE, we need to 
extend the summary of a sequence of parentheses. 

Let us note that it is possible to uniquely decompose any sequence of parentheses in three 
contiguous (possibly empty) pieces, namely a prefix, a body and a suffix. If the down-value of 
the sequence is (strictly) negative, then the prefix ends in leftmost minimal-depth parenthesis 
of the sequence, and the body ends in the rightmost minimal-depth parenthesis. If, on the 
contrary, the down-value of the sequence is 0 , we can distinguish two separate cases: if the 
up-value is 0 , then both the prefix and the suffix are empty, and the body coincides with 
the whole sequence; else, both the prefix and the body are empty, and the suffix coincides 
with the whole sequence. In any case, notice that the body of a sequence is a balanced 
subsequence, made of zero or more subtrees. As an example, consider these five sequences: 

H )()((): the prefix is ), the body is 0 and the suffix is (() 

H ) ()): the prefix is ) ()), both body and suffix are empty 

H )) (: the prefix is )), the body is empty and the suffix is ( 

■ ((): both the prefix and the body are empty, and the suffix is (() 

■ ( 00 ): both the prefix and the suffix are empty, while the body is ( 00 ) 

We use this property, i.e. the unique decomposition of a sequence of parentheses, in the 
two summaries, used respectively by REDUCE-CHILDREN and reduce-CHILD-SUBTREE to 
incrementally aggregate information about subtrees. Below we report the simpler one, used 
in REDUCE-CHILDREN: 

► Definition 9 (rc-summary). An rc-summary of a sequence of parentheses is a tuple having 
these fields: 
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H prefix-depth, the depth of the minimal-depth parenthesis 

H body-combination, the 0-combination of the values of the nodes associated with the 
subtrees of the body of the sequence. 

H suffix-depth, the difference between the depth of the last parenthesis and the depth of 
any minimal-depth parenthesis. 

H suffix-info, the value associated with the first node of the suffix, if any. 

The similar rcs-summary, used in reduce-CHILD-SUBTREE, is reported in the Appendix. 
These two summaries, to be stored as usual in the nodes of the underlying BEST, and the 
three generic functions above can be used to implement several functions, and below we 
report few examples. 

Functions implemented using reduce-children. We can use reduce-children to im¬ 
plement : 

H CHILDREN-SUM(u): Finds the sum of the values of the children of node u. This is equivalent 
to reduce-children(u,+). 

H CHiLDREN-MAx(u): Finds the maximal value among those of the children of node v. This 
is equivalent to reduce-CHILDREN(u, max). 

Note that, if we set val(a;) = 1 for every vertex in the forest, degree(u) can be derived as 
well from reduce-children(u, +). 

Functions implemented using reduce-child-subtrees. In the case of reduce-child- 
SUBTREES we can implement: 

H SUBTREE-SUM(w): Finds the sum of the values of the nodes in the subtree of node u, and 
is equivalent to val(z;) + reduce-child-subtrees(w, +, +). 

H SUBTREE-SIZe(u): Finds how many nodes are there in the subtree of node v, and is 
equivalent to SUBTREE-SUM(z;) when val(a:) = 1 for every node x of the forest. 

H SUBTREE-MAx(u): Finds the maximal value among those of the nodes in the subtree of 
node V, and is equivalent to max(val(z;), REDUCE-CHILD-SUBTREES(u, max, max). 

H MAXSUM-CHILd(u): Finds the maximal value of SUBTREE-SUM among the children of 
node V. This is equivalent to REDUCE-CHILD-SUBTREES(w,0, max)). 

Functions implemented using combine. A simple example of combine is depth(z;), which 
finds the depth of node u, i.e. the distance from v to the root of the tree v belongs to. Indeed, 
this is equivalent to COMBINe(u, +), assuming val(a;) = 1 for every node x of the forest. We 
can implement distance(u, u), i.e. the distance in the tree between u and v, by computing 
DEPTH(u) + DEPTH(u) — 2 • DEPTH(LCA(lt, u)). 

If we want to compute the distances in a weighted tree (i.e., we have weights on the 
edges), the same idea holds; since we store the information in the nodes, we store the weight 
of an edge connecting a child node to the parent node inside the child node. 

[~5l Applications 

In this section we show, in order to provide a few examples, how to use DFT-Trees to 
solve several problems that can be modeled as subtree problems. In particular, in all the 
applications that we describe we will refer to a common scenario: we ask queries about a 
single node v, and the queries can be answered by looking at the subtrees of u, i.e. the 
subtrees rooted in the children of v, together with the part of the tree that is above v, that 
we will denote by T„: this is the part of the tree that we reach through the parent of v. We 
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M Figure 6 A graph (left) and its Block Forest |31| (right). 


will describe the applications in increasing order of complexity, from the perspective of the 
DFT-Trees: indeed, as we will see, to compute the impact of an articulation point v we 
need to compute the size of the subtrees of u, and of T„; for the betweenness centrality we 
also need to evaluate the sum of the squared sizes of the subtrees of v, and, finally, for the 
closeness centrality we need the the sum of all the distances from v to every node, both in 
its subtree and above it. 

5.1 Biconnectivity properties and impact of articulation points 

The DFT-Tree can be used to maintain all the (bi)connectivity properties of a streaming 
graph, following the same approach proposed by Westbrook and Tarjan m- as we mentioned 
before, it is sufficient to observe that the DFT-Tree supports all the operations needed by 
the algorithm of Westbrook and Tarjan to maintain the Block Forest (shown in Figure]^, 
including condense that, as we mentioned before, is not a standard operation in the case 
of the dynamic tree problem. Indeed, it is possible to maintain connected and biconnected 
components, and bridges and articulation points of a streaming graph. 

We now show how to answer queries on the impact of an articulation point. We recall, 
from [ 5 ] , that the impact of an articulation point v is the number of nodes that get disconnected 
from the main connected component when v is removed from the graph. Looking at the the 
Block Forest, Figure]^ (right), it is easy to see that the articulation points are exactly the 
square nodes that connect two or more round nodes (the biconnected components). When 
an articulation point is removed, its Block Tree splits into pieces: in order to compute the 
impact, we need to know the size of each of them: the impact is, by definition, the sum of all 
the size of the trees except the largest one (the main connected component). If we refer the 
subtree operations seen in the previous section, we can use the DFT-Tree in the following 
way: 

H The value in each round node in the tree is 0 (they corresponds to biconnected components), 
and 1 in each square node (corresponding to real nodes in the graph). 

H The size of the Block Tree can be computed by finding the root of the tree, using ROOT 
and then computing its subtree-size. 

H The size of the maximum subtree of v can be computed using maxsum-CHILD. 

It is easy to see that, with the operations described above, we can compute the impact of a 
node, and thus we can state the following result. 

► Lemma 10. Using a DFT-Tree, it is possible to answer impact queries of a vertex in 
time C>(logn). 
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5.2 Betweenness centrality 

The betweenness centrality definition involves shortest paths, but, since in a tree there is 
exactly one path between each pair of nodes, the goal here is, given a vertex w, to count all 
the paths that pass through it. We can do this using DFT-Trees in the following way. Let 
us assume that vertex v has k children, each of them with a corresponding subtree (eventually 
made by one node only, i.e. the child is a leaf). Let us denote with sti,st2 ■ ■. stk the subtrees 
of V. The number of (shortest) paths through v can be partitioned into two components: 
i) the paths between the subtrees of v and the rest of the tree, i.e. {a} U Ty, and ii) the 
paths between all the possible pairs of subtrees of v. The first component can be computed 
easily, using the fact that |T„| = SUBTREE-SIZe(root(u)) — SUBTREE-SiZE(a). The second 
component is the sum of the products of all the possible pairs of sizes, i.e., 
its computation is more tricky, if we want to avoid the iteration for every subtree. The idea 
is the following: 

H The value of each node in the tree is the pair ( 1 , 1 ). 

H We define (a, a^) 0 {b, 6^) to be (a + b,{a + 5 )^). 

H We have, as an invariant, that the values computed by © are a couple made by a number 
and its square, e.g., {x,x‘^). Note that this defines an associative operation. 

H We define (a, a^) 0 (&, 6^) to be (a + b,a^ + 6^) (i.e., the usual vector sum). 

Now, if we call REDUCE-CHiLD-SUBTREES(a, ©, 0) we obtain, for v, the couple made by the 
sum of the sizes of its subtrees, and by the sum of the squares of the sizes of its subtrees: 

(|sti| 0 |st2| 0 ■ ■ ■ 0 \stk\, |stip 0 |st2p 0 ■ • ■ 0 Istfep) = (X) II) I] IP)• ^ is easy to see, 

using the rule of the square of a sum, that the needed second component can be obtained by 
the couple of values. This allow us to state the following Lemma. 

► Lemma 11. Using a DFT-Tree, it is possible to answer betweenness centrality queries 
of a vertex in time 0(logn). 

5.3 Closeness centrality 

The closeness centrality [ 7 ] of a vertex is defined as the reciprocal of its farness, the sum of 
the distances to all the other vertices. We now show how to maintain the farness of each 
vertex, using the DFT-Trees. The main ingredients are: 

H We modify the DFT-Trees in order to support the two following operations: ADD- 
TO-PATH(a,5) that adds 6 to all the vertices in the path between v and the root, and 
ADD-TO-SUBTREE(n, ( 5 ) that adds 5 to all the vertices in the subtree of v. Note that we 
can implement both these operations in 0 (logn) per update and value query, without 
affecting the complexity of the structural operations. 

H each vertex stores two values, UP-dists that is the sum of the distances to the vertices in 
Ty, and down-dists that is the sum of the distances to the vertices in its subtree. Note 
that the farness of a vertex is the sum of this two values. 

Now, just to provide an example: assume that we are doing a link operation, adding the 
edge between u and v, whose weight is w. Let us denote the size of the tree u (resp. v) 
belongs to with s„ (resp. s^). The following operations need to be executed before the actual 
linking to maintain the information: 

H the DOWN-DISTS of all the nodes in the path of u are increased by w ■ SUBTREE-SIZe(z;) 0 
DOWN-DISTS(a); 

H the UP-DISTS of all the nodes in the subtree of v (included) are increased by w ■ 
SUBTREE-SIZe(rOOt(u)) 0 UP-DISTS(u) 0 DOWN-DISTS(a); 
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H the UP-DISTS of all the nodes in the tree containing rt, with the only exception of the 
nodes in the path of u, are increased by w ■ SUBTREE-SIZe(z;) + down-dists('(;). In order 
to do so, we add it to all the nodes (i.e. the subtree of ROOt(m)), and then we subtract it 
from all the nodes in the path of u. 

The other structural update operations are similar, and can be derived in a similar fashion 
(we report them in the Appendix). This allow us to state the following Lemma. 

► Lemma 12. Using a DFT-Tree, it is possible to answer closeness centrality queries of a 
vertex in time 0{logn). 


6 


Conclusion and future works 


In this paper we presented a novel data structure, the Depth First Tour Tree. This structure 
is based on a linearization of a DFS visit of the tree, similarly to the ET-Trees (based on a 
Euler Tour). 

The structure is simple and easy to implement; it provides a framework for a large class 
of data aggregation tasks - especially on subtrees, a task that is usually non-trivial with 
other data structures. Furthermore, DFT-Trees can naturally aggregate on all the children 
subtrees of a node v in parallel without having to pay a cost proportional to the degree of v 
itself: as we already mentioned, given a node v it takes 0 (logn), independently from the 
degree of v, to answer the child of v whose subtree is the largest. 

This flexibility, related to subtree queries, is paid by the evert operation, that requires a 
cost proportional to the depth of the node being everted. However, as discussed, when either 
the number of eversions is small compared to the total number of queries performed, or the 
costs of the eversion is amortized, the cost of evert can be regarded as being 0 {logn) like 
all the other structural operations. 

We showed that this is the case in all the applications presented in the previous section. We 
described how to customize the data structure in order to answer queries for three different 
applications: impact of the removal of an articulation point from a graph, betweenness 
centrality and closeness centrality of a dynamic tree. 

In the future, we plan to experimentally assess the performance of our data structure, 
and compare it with the existing alternatives, following the approach of [^. We believe that 
the simplicity of our approach, when compared e.g. to the work of [23j in the context of the 
SUBTREE-MAX Operation, is likely to deliver faster and more readable code in practice. 
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H Implementation of DFT-Trees using Splay Trees 

In this appendix we detail the pseudo-code for all the supported operations in a DFT-Tree, 
using the Splay Trees m, that are used by Tarjan and and Tarjan m to implement both 
the ST-trees [211 [2S], ET-trees [17112H] . 

The DFT-Tree is thus stored as an augmented splay tree, where the comparison x ^ y 
between two entries x and y of the depth first tour evaluates to true iff entry x comes before 
entry y in normal left-to-right order. 

Since the focus of the paper has been devoted to subtree computations, we note here that 
in Appendix |A.5| we show an example of how to use combine to compute a path operation. 

A.l Basic splay operations 

We will take for granted the implementation of these basic operations on the splay tree, besides 
the tree rotations, SPLAY, splay-erase, splay-min / splay-max and splay-predecessor 
/ splay-successor: 

splay-root(u): Returns the root node of the splay tree containing node v. 
splay-lca(M, v)\ Returns the lowest common ancestor of the splay nodes u and v. Of course, 
u and V must belong to the same splay tree (i.e. SPLAY-root(u) = SPLAY-root(w)). 
splay-merge(M, v) : Joins the splay tree Ti containing node u with the splay tree T 2 containing 
node V. If u and v belong to the same tree, nothing happens. If u and v belong to 
different tree, the keys contained in Ti are considered to precede all the keys in T 2 . 
splay-split(D): Splits the splay tree T containing v into two different splay trees: the first 
contains all the keys which are -< v, and the second contains all the keys which are v. 
splay-precedes(M, v) : Returns true if u ^ u, false otherwise. 

Operation SPLAY-root can be implemented by simply moving from a node to its parent 
until we eventually reach the root of the splay tree. This method clearly results in amortized 
logarithmic complexity with respect to the tree size. 

SPLAY-LCA can be implemented by marking all the nodes in the path from v to the root, 
and then moving up the tree starting from u, stopping at the first marked node found on 
this path, which corresponds to the sought ancestor. 

Also, it is possible (see m) to support splay-merge and SPLAY-SPLIT in logarithmic 
time in the size of the trees involved. 

Implementation for precedes is given in Algorithm [l] 


Algorithm 1 Implementation of splay-precedes 

1 

procedure splay-precedes(m, v) 

> u and V are dft nodes. 

2 

successor -e- SPLAY-SUCCESSOR(u) 


3 

splay-split(u) 


4 

answer (spAY-ROOt(u) == SPLAY-ROOt(ii)) 


5 

if successor 7 ^ NULL then 

> Restore tree. 

6 

SPLAY-MERGe(m, successor) 


7 

end if 


8 

return answer 


9 

end procedure 



We will assume that every splay node contains a pointer to its twin, i.e. to the other dft 
node associated to the same tree node. 

In general, we will maintain a collection of disjoint splay trees, where in turn a splay tree 
can maintain the depth first tours of one or more (disjoint) trees. When a splay tree contains 








16 


Dynamic subtree queries revisited: the Depth First Tour Tree 


only one dft, we say that the dft has a dedicated splay tree. We provide an internal operation, 
SPLiCE(r;), which makes sure that the dft of the tree containing v gets a dedicated splay 
tree. Notice that splice alters the internal splay tree representation, without affecting the 
represented tree. Assuming that we already have implemented operation ROOT, implementing 
SPLICE in logarithmic time is rather straightforward and is done in Algorithm]^ 


Algorithm 2 Implementation of splice 

1: procedure SPLICe(i!) 

2: open-root open-node of ROOt(ii) 

3: close-root close-node of root(w) 

4: predecessor C— SPLAY-PREDECESSOR(open-root) 

5: if predecessor A NULL then 

6 : SPLAY-SPLIT(predecessor) 

7: end if 

8 : SPLAY-SPLIT(close-root) 

9: end procedure 


A.2 Import/export operations 

Building the DFT-Tree of a given tree, encoded in the adjacency list format, is a very 
simple task, and can be seen as an easy modification of the classical dfs algorithm. 

The opposite task, i.e., restoring the original tree given its depth first tour, is also very 
simple. Indeed, it is enough to keep track of the current open node using a stack, while we 
process every node in the given DFT-Tree: see Algorithm 


Algorithm 3 Depth first tour to tree conversion 

1: procedure dft-to-tree(DFT) [> DFT is a list here 

2 : s empty stack 

3: for all (node, tag) in DFT in order, do 

4: if tag is an open-tag then 

5: if s emptyO then 

6: mark node as the root of the tree 

7: else 

8: add node to the children of s.top(). 

9: end if 

10: s.push(node) 

11 : else 

12: s.popO 

13: end if 

14: end for 

15: end procedure 


To perform import-tree we first construct the depth first tour of the input tree, and 
then build a splay tree corresponding to it. Since the order of the nodes in the depth first 
tour coincides with the order maintained by the underlying splay tree, we can perform a 
linear time tree construction as described in [...]. To correctly maintain the extra information 
stored in the nodes of the splay tree, we can propagate them from the leaves up to the root, 
combining them using the RECALC-extra-INFO function, leading to an overhead which is 
linear in the size of tree, hence not affecting the total complexity of the operation. 

Operation export-tree performs an in-order traversal of the (spliced) splay tree, 
extracting a list version of the depth first tour it represents, and then runs DFT-to-tree on it. 
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Since both operations have linear complexity in the tree size, we can support export-tree 
in linear time. 


A.3 Structural updates 

In this section we describe the implementation of the structural update operations on a 
DFT-Tree. In particular, the most important operations are the link and CUT, whose 
effect on the parenthetical sequence is shown in Figure]^ 

Suppose an edge is created between the root v of tree T 2 and node u of tree Ti. From 
the point of view of depth first tours, what happens is that the dft of T 2 is inserted into the 
dft of Ti right after the open-node corresponding to u. See Algorithm]^ below. 


Algorithm 4 Implementation of link 

I: procedure link(u, v) 

2: if not same-tree(«, v) then 

3: open-u open-node of node u in the dft 

4: close-u close-node of node u in the dft 

5: open-v c— open-node of node v in the dft 

6: SPLAY-SPLIT(open-u) 

7: SPLAY-MERGE(open-u, open-v) 

8: SPLAY-MERGE(open-u, close-u) 

9: end if 

10: end procedure 


Operation CUt(u) is analogous and has the effect of extracting the sub-segment of the 
dft corresponding to the subtree rooted in v, as illustrated in Figure Its implementation is 
symmetric to the one of link: 


Algorithm 5 Implementation of cut 

I: procedure CUt(ii) 

2: root ROOt(-!;) 

3: if a 7 ^ root then 

4: open-v ■(— open-node of node v in the dft 

5: close-v close-node of node v in the dft 

6 : open-root u- open-node of root in the dft 

7: close-root ^ close-node of root in the dft 

8: SPLAY-SPLIT(SPLAY-PREDECESSOR(open-v)) 

9: SPLAY-SPLIT(close-v) 

10: SPLAY-MERGE(open-root, close-root) 

II: end if 

12: end procedure 


Note that the call to predecessor in line 8 is licit: since v is not the root of the tree, 
open-v cannot be the first node in the dft. 

The effect of operation CONDENSE(a) on the dft of the tree is explored in Figure]^ and 
corresponds to the deletion of the open- and close-node associated with v in the dft. 
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M Figure 7 Effects of the condense operation on the dft. 


Algorithm 6 Implementation of condense 

1: procedure CONDENSe(w) 

2 : open-v open-node of node v in the dft 

3: close-v c— close-node of node v in the dft 

4: SPLAY-ERASE(open-v) 

5: SPLAY-ERASE(close-v) 

6 : end procedure 


Operation erase(p) is equivalent to a call to CUt(u) followed by a call CONDENSe(r). 


Algorithm 7 Implementation of erase 

1: procedure erase(i;) 

2: CUT(ti) 

3: condense(-!;) 

4: end procedure 


Notice that both erase and CONDENSE may lead to dft having non-dedicated splay trees. 

Operation evert(r) can be implemented in two different ways. The first one makes a call 
to EXPORT-TREE, Operates an 0(n) evert operation on the adjacency list version of the tree 
and finally rebuilds the splay version using import-tree, for a total of 0 {n) operations on a 
tree of size n. The second way of performing the eversion consists in the following recursive 
algorithm, whose complexity is 0(/ilogn), where h is the depth of node v. 


Algorithm 8 Implementation of evert 

1: procedure evert(«) 

2: root ROOt(w) 

3: if II 7 ! root then 

4: parent ^ PARENT(ti) 

5: CUT(t) 

6: EVERT(parent) 

7: LINK(ti, parent) 

8 : end if 

9: end procedure 
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A.4 Non-structural operations 

Operation SAME-tree(m, v) is straightforward and corresponds to checking whether root(m) = 
ROOt(r) or not. 

To implement is-descendant we first make the following observation: 

► Lemma 13. 

Let u and v be two nodes, having open-nodes open-u, open-v and close-nodes close-u, 
close-v respectively. Node u is a descendant of node v if and only if open-v ^ open-u and 
close-u ^ close-v. 

Using the previous observation, implementing is-descendant becomes a straightforward 
task, shown in Algorithm 


Algorithm 9 Implementation of is-descendant 

1: procedure is-descendant(u, a) 

2 : open-u ^ open-node of node u in the dft 

3: close-u close-node of node u in the dft 

4: open-v c- open-node of node v in the dft 

5: close-v close-node of node v in the dft 

6: return SPLAY-PRECEDES(open-v, open-u) A SPLAY-PRECEDES(close-u, close-v) 

7: end procedure 


Operation list-children repeatedly uses operation SPLAY-SUCCESSOR to traverse con¬ 
secutive siblings, shown in Algorithm [T0| 


Algorithm 10 Implementation of list-children 

1: procedure list-children(w) 

2 : open-v c- open-node of node v in the dft 

3: close-v close-node of node v in the dft 

4: current SPLAY-SUCCESSOR(open-v) 

5: children empty list 

6: while current A close-v do 

7: children. push(tree node associated to current) 

8: current SPLAY-SUCCESSOR(current. twin) 

9: end while 

10: return children 

11: end procedure 

Note: we recall that the twin of a dft node u is the (pointer to) the other dft node u' associated to 
the same tree node as u. In this case, line 8 finds the next sibling of the tree node associate with 
current. 


Operation parent, briefly described in Section]^ is the first non-trivial operation, as we 
begin to exploit the parenthetical sequence of dft and to work on the augmented splay tree 
nodes. As such, we first need to set some definitions about sequences of parentheses. 

► Lemma 14. The suffix of the dft of the whole tree starting after close-v begins with the 
concatenation of the dft of zero or more siblings of node v, followed by the close-node of the 
parent of v. 

We provide a visual insight in Figure]^ 
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M Figure 8 Characterization of the close-node of the parent of v. The small nnmbers nnder the 
dots represent the snmmary down-valnes. 


Given the monotonicity of the summary down-values noted above, we can devise a binary 
search algorithm for finding the parent of any non-root node, shown in Algorithm E] 


Algorithm 11 Implementation of parent 

1: procedure RECURSlVE-PARENT(splay-node, summary) t> pre: the down-value of summary is 0 
2: if splay-node . left-child yf NULL then 

3: if (summary EB splay-node . left-child.range-summary).down-value < —1 then 

4: return RECURSIVE-PARENT(splay-node . left-child, summary) 

5: else 

6: summary C— summary EB splay-node . left-child .range-summary 

7: end if 

8 : end if 

9: if (summary EB splay-node.node-summary).down-value < —1 then 

10 : return splay-node 

11 : else 

12: summary summary EB splay-node .node-summary 

13: end if 

14: return RECURSIVE-PARENT(splay-node. right-child, summary) 

15: end procedure 
16: 

17: procedure parent(i;) 

18: root ^ ROOT(n) 

19: if n yf root then 

20: close-v c- close-node of v 

21: successor C— SPLAY-SUCCESSOR(close-v) 

22: SPLAY-SPLIT(close-v) 

23: close-parent ^ RECURSIVE-PARENT(SPLAY-ROOT(successor), (0,0)) 

24: SPLAY-MERGE(close-v, successor) 

25: return the tree node having close-parent as close-node 

26: else 

27: return null 

28: end if 

29: end procedure 


Operation LCA can be supported in a similar fashion, since the following result holds: 

► Lemma 15 (characterization of the lea). Let u,v be distinct nodes belonging to the same 
tree, for which none is a descendant of the other, and let close-u and close-v be their 
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M Figure 9 Characterization of the lea of nodes u and w. The lea is the parent of w, the node 
associated with any parenthesis having minimal depth. 


close-nodes in the dft. Suppose further, without loss of generality, that close-u ^ close-v. 
Consider the subsequence of the parenthetical sequence ofT, starting in close-u and ending 
in close-v, and let w be the leftmost dft-node having minimal depth. The lowest common 
ancestor of u and v is the parent a of the tree node corresponding to w. More specifically, w 
is child of a closest to node u, i.e. the seeond-to-last node in the path from u to a. 

See Figure]^ for a visual insight. To quickly determine w we augment the concept of 
summary, so that it keeps track of some parenthesis reaching minimal depth. More formally, 
we consider the following definition: 

► Definition 16. (lea-summary of a sequence of parenthesis) The lea-summary of 
a sequence of parentheses is an ordered pair (s,p), where s is the summary of the given 
sequence and p is a (pointer) reference to the leftmost parenthesis having depth equal to the 
down-value of s. If no such parenthesis exists, p is set to null. 


It is easy to adapt the addition operator between summaries to lea-summaries, so that 
we can easily evaluate the lea-summary of the concatenation of two sequences, as can be 
seen in Theorem El 


► Lemma 17. Let Si, S 2 be two sequences of parenthesis having lea-summary (si = (oi, 61), Pi) 
and (s 2 = ( 02 , b 2 ),P 2 ) respectively. The summary of the sequence Si -\- S 2 obtained by con¬ 
catenating Si and S 2 is the pair (si,Pi) ffl (s2,T2); where the sum between lea-summaries is 
defined as: 


(Sl,pi) ffl (S2,P2) 


(siffls2,Pi) ifbi-\-a 2>0 

(sifflsi,p 2 ) otherwise. 


► Lemma 18. Let {s,p) be the lea-summary of a sequence of parentheses. Pointer p points 
to NULL if and only if all the depths of the parentheses are (strictly) positive. 


► Lemma 19. The sum of two lea-summaries defined above is an associative operation. 


Note that by Proposition 18 it follows that the lea-summary associated with the range 
indicated in Theorem has a non-NULL reference, since the first parenthesis of the range is 
a closed-parenthesis. 

As before, we augment the nodes of the splay tree so that every node keeps the extra 
values 
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H node-lca-summary, corresponding to the lea-summary of the node in question; 

H range-lca-summaxy, the lea-summary of the subsequence associated with the splay 
subtree rooted in the node in question. 

We sketch the algorithm for determining the lea of two nodes in Algorithm [T^ 


Algorithm 12 Implementation of lca 

1 : procedure lca(m, v) 

2 : if is-descendant(m, v) then 

3: return v 

4: elseif is-descendant(i!, u) then 

5: return u 

6: end if 

7: close-u ^ close-node of node u in the dft 

8 : close-v c- close-node of node v in the dft 

9: if SPLAY-PRECEDES(close-v,close-u) then 

10 : return lca(u, u) 

11: else 

12: pred-u SPLAY-PREDECESSOR(close-u) 

13: succ-v SPLAY-SUCCESSOR(close-v) 

14: SPLAY-SPLIT(pred-u) 

15: SPLAY-SPLIT(close-v) 

16: range-root -f- SPLAY-ROOT(close-v) 

17: dft-w ■(— range-root.range-lca-summary.p 

18: SPLAY-MERGE(pred-u, close-u) 

19: SPLAY-MERGE(close-u, succ-v) 

20 : ui the tree node associated with dft-w 

21 : return parent(w) 

22: end if 

23: end procedure 

Note: note that, since lines 12-19 run only if u is not a descendant of v and v is not a descendant 
of It, prec-u and succ-v are non-NULL, well-defined nodes. 


We conclude this section discussing how to implement ROOT. One may be tempted to say 
that ROOt(r) is the node associated with the SPLAY-min of the splay tree containing the dft 
nodes corresponding to v. Unfortunately, this is not true when the dft of the tree containing 
V is kept in a non-dedicated splay tree. Thus we need the following in Theorem |20[ 


► Lemma 20 (characterization of the root). Let v be a node, and let close-v be the close-node 
associated with v. The close-node of the root of the tree containing v is the leftmost dft-node 
^ close-v having minimal depth. 


In other words, Theorem states that the p value of the lea-summary of the suffix of 
the splay tree starting in close-v is the close-node of the root of v. As before, Theorem [T8| 
guarantees that the p value of that range is not null, as the first node in the range is a 
closed parenthesis. This leads to an easy implementation, shown in Algorithm [T^ 
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Algorithm 13 Implementation of root 


1: procedure root(i!) 

2: close-v close-node of node v in the dft 

3: predecessor ^ SPLAY-PREDECESSOR(close-v) 

4: SPLAY-SPLIT(predecessor) 

5: splay-root ^ SPLAY-ROOT(close-v) 

6: dft-w range-root.range-lca-summary.p 

7: SPLAY-MERGE(predecessor, close-v) 

8: lu the tree node associated with dft-w 

9: return w 

10 : end procedure 


A.5 Reductions and combinations 


We recall that operation combine value of 


val(i;i) © val(t; 2 ) © ■ ■ • © val(n;i), 


where v = vi,V 2 , ■ ■ ■ ,Vh are the nodes in the path from v to the root of the tree, and © is 
any invertible associative binary operation acting on the values attached to the nodes. We 
augment the splay tree, adding two fields: 
item-val, the value of the node, and 

H range-val, the ©-combined value of item-val for all the dft nodes in the splay subtree 
rooted in the node in question 

In particular, if v is a dft node associated with the tree node v, we set 



d(u) if V is a dft open-node 

val(w) if V is a dft close-node 


V. item-val = 


where —x indicates the inverse of x with respect to ©. 

► Lemma 21. Let open-v be the open node associated with the tree node v. The value of 


COMBINe(©,u) is equal to the Q-combination of the item-val of the nodes in the prefix of 


the dft ending in open-v. 

As an example, consider the case in which © denotes the usual addition of real numbers: 
a visual insight for Theorem [2T] is given in Figure [T0| The pseudocode of COMBINE is detailed 
in Algorithm 

Algorithm 14 Implementation of combine 

1: procedure COMBINe(0, d) 

2: open-v open-node of node v in the dft 

3: close-root ^ close-node of the tree root 

4: SPLAY-SPLIT(close-v) 

5: answer ^ SPLAY-ROOT(close-v).range-val 

6: SPLAY-MERGE(close-v, close-root) 

7: return answer 

8 : end procedure 


The rc-summary, defined in Section of the concatenation of sequences Si and S 2 is 
computed by Algorithm [T^ 
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Figure 10 Visual insight for Theorem 
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The numbers written in the nodes of the tree on the 


left represent the values assigned to the vertices. 


Algorithm 15 Implementation of combine-rc-SUMMARies 

1: procedure combine-rc-SUMMARies(s1, s2, ©) 

2 : answer empty rc-summary 

3: if si. suffix-depth + s2.prefix-depth > 0 then 

4: answer.prefix-depthsi.prefix-depth 

5: answer.body-combination si.body-combination 

6 : answer.suffix-depth si.suffix-depth + s2.prefix-depth + s2.suffix-depth 

7: answer. suffix-info si. suffix-inf o 

8 : end if 

9: if si. suffix-depth + s2.prefix-depth < 0 then 

10 : answer.prefix-depth si.prefix-depth + si.suffix-depth + s2.prefix-depth 

11 : answer.body-combination C— s2.body-combination 

12 : answer.suffix-depth s2.suffix-depth 

13: answer.suffix-info s2.suffix-info 

14: end if 

15: if si. suffix-depth + s2. prefix-depth = 0 then 

16: answer.prefix-depth si.prefix-depth 

17: answer.body-combination s2.body-combination © si.body-combination © 

si.suffix-info 

18: answer.suffix-depth s2.suffix-depth 

19: answer.suffix-info C— s2.suffix-info 

20 : end if 

21 : end procedure 


We can augment the splay tree nodes as before, keeping track of the summary combination 
for every range associated with the nodes of the splay tree. The result of COMBINE- 
CHILDREN(w, ©) is equal to to body-combination field of the rc-summary of the range 
starting in the successor of open-v and ending in the predecessor of close-v. 

To support COMBINE-CHILD-SUBTREE we need to extend the definition of rc-summaries 
to keep track of the partial combination in the prefix and the suffix. 

For the sake of completeness we report below the summary used by reduce-CHILD- 
SUBTREES. 

► Definition 22 (rcs-summary). An rcs-summary of a sequence of parentheses is a tuple 
having these fields: 

H prefix-depth, the depth of the minimal-depth parenthesis 

H pref ix-©-inf o, the ©-combination of the values of the nodes associated with the prefix 
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H body-0-inf o, the 0-combination of the values of the nodes associated with the body 

H body-0-info, the 0-combination of the S-values of the subtrees in the body 

H suf f ix-0-inf o, the 0-combination of the values of the nodes associated with the body 

H suffix-depth, the difference between the depth of the last parenthesis and the depth of 
any minimal-depth parenthesis. 

A.6 Applications: betweenness and closeness centrality 

We now need to show how to maintain the information related to up-dists and down-dists 
when we perform the following structural updates: 

H LINK 

H CUT 

_ CONDENSE 

Note that the other structural updates are maintained: evert is implemented using link 
and CUT; erase is implemented using CUT and CONDENSE. 

link. As we mentioned in Section [53| in the case of a link operation, where we add the edge 
between u and v, whose weight is w, the following operations need to be executed before the 
actual linking to maintain the information (we denote the size of the tree u (resp. v) belongs 
to with Su (resp. Sy))' 

H the DOWN-DISTS of all the nodes in the path of u are increased by w ■ SUBTREE-SIZe(i;) 0 
DOWN-DISTS('(;); 

H the UP-DISTS of all the nodes in the subtree of v (included) are increased by w ■ 
SUBTREE-SIZe(rOOt(u)) 0 UP-DISTS(u) 0 DOWN-DISTS('(;); 

H the UP-DISTS of all the nodes in the tree containing u, with the only exception of the 
nodes in the path of u, are increased by w ■ SUBTREE-SIZe(i;) 0 down-dists(p). In order 
to do so, we add it to all the nodes (i.e. the subtree of ROOt(m)), and then we subtract it 
from all the nodes in the path of u. 

cut. The CUT is the dual of the link, thus we execute the following operations after the cut: 

H the DOWN-DISTS of all the nodes in the path of u are decreased by w ■ SUBTREE-SIZe('(;) 0 
DOWN-DISTS('(;); 

H the UP-DISTS of all the nodes in the subtree of v (included) are decreased by w ■ 
SUBTREE-SIZe(rOOt('u)) 0 UP-DISTS(u) 0 DOWN-DISTS(u); 

H the UP-DISTS of all the nodes in the tree containing u, with the only exception of the 
nodes in the path of u, are decreased by w ■ SUBTREE-SIZe(z;) 0 DOWN-DiSTS(ti). In order 
to do so, we subtract if from all the nodes (i.e. the subtree of ROOt(m)), and then we 
add it to all the nodes in the path of u. 

condense When we condense node v, let us denote by u the parent of v and by w the weight 
of the edge {u,v). We execute the following operations before condensing: 

H the DOWN-DISTS of all the nodes in the path of u are decreased by w ■ SUBTREE-SIZe(t); 

H the UP-DISTS of all the nodes in the subtree of v (included) are decreased by w ■ 
(subtree-size(root(i;)) - subtree-size(v)); 

H the UP-DISTS of all the nodes in the tree containing u, with the only exception of the 
nodes in the path of u, are decreased by w ■ SUBTREE-SIZe(u). In order to do so, we 
subtract if from all the nodes (i.e. the subtree of ROOt(m)), and then we add it to all the 
nodes in the path of u. 

We now detail how to maintain a value in the node, such as down-dists and UP-dists, 
under the two following operations: ADD-to-path(i;, S) that adds S to all the vertices in the 
path between v and the root, and ADD-to-SUBTREe(i;, (5) that adds 5 to all the vertices in 
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the subtree of v. In each node we maintain the following information, that will be used to 
derive the value of the node|3 
H A^, to be forwarded in the path of the node; 

H A^, to be forwarded in the subtree of the node; 

H A,, relative to the node. 

In the begininning A^ and A^ are equal to 0, whilst A, has the initial value of the node. 
This allow us to state the following Lemma. 

► Lemma 23. Using a DFT-Tree, it is possible to answer closeness centrality queries of a 
vertex in time 0{logn). 

In the following we report the pseudocode of the affected operations, where we show the 
changes from the previously shown pseudocodes in red (best viewed in color). 


Algorithm 16 Implementation of get-effective-val 

1: procedure get-effective-val(w) > Oilogn) 

2: return A.(t)) + (sum of A^(i;) in the subtree of v) + (sum of Aj_(a) in the path of v) 

3: end procedure 


Algorithm 17 Implementation of ingrement-val - increase the value of node a by <5 

1: procedure INCREMENT-val(v, (5) >0(1) 

2: A. (a)A. (a)-I-(5 

3: end procedure 


Algorithm 18 Implementation of change-val - set the value of node v to target 


1: 

procedure CHANGE-VAL(t), target) 

> Cl(logn) 

2: 

S target — GET-EFFECTIVE-VAL (a) 


3: 

INCREMENT-VAL(ti, 5) 


4: 

end procedure 



Algorithm 19 Implementation of link 

1: procedure link(u, a) 

2: if not same-tree(w, v ) then 

3: A.|.(u) ^ A^(m) - sum of A^ of subtree of v 

4: A 4 .(u) A^(u) - sum of Aj. in the path of u 

5: open-u open-node of node u in the dft 

6: close-u close-node of node u in the dft 

7: open-v c— open-node of node v in the dft 

8: SPLAY-SPLIT(open-u) 

9: SPLAY-MERGE(open-u, open-v) 

10: SPLAY-MERGE(open-u, close-u) 

11: end if 

12: end procedure 


4 


Thus, in order to maintain both down-dists and up-dists we need six distinct values in a node: a 
Aj,, and A, for down-dists, and a A^, A^, and A. for up-dists. 
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Algorithm 20 Implementation of cut 

1: procedure CUt(ii) 

2: root ROOt(-!;) 

3: if r 7 ^ root then 

4: A-|-(parent(ii)) A-f^(pARENT(t)) + sum A-|- in the subtree of v 

5: A 4 ,(w) A^(u) + sum of Aj, in the path of parent(u) 

6: open-v ■(— open-node of node v in the dft 

7: close-v close-node of node v in the dft 

8: open-root ^ open-node of root in the dft 

9: close-root ^ close-node of root in the dft 

10: SPLAY-SPLIT(SPLAY-PREDECESSOR(open-v)) 

11: SPLAY-SPLIT(close-v) 

12: SPLAY-MERGE(open-root, close-root) 

13: end if 

14: end procedure 


Algorithm 21 Implementation of condense 

1: procedure CONDENSe(?;) 

2: if V ^ ROOt(u) then 

3: A-|-(pARENT(ii)) = A-|-(pARENT(u)) - f A-|-(w) 

4: A4,(pARENT(u)) = Aj,(pARENT(u)) - f A4,(-!;) 

5: A.(parent(u)) = A,(parent(u)) — A_i,(-!;) 

6: end if 

7: open-v c— open-node of node v in the dft 

8: close-v close-node of node v in the dft 

9: SPLAY-ERASE(open-v) 

10: SPLAY-ERASE(close-v) 

11: end procedure 













